Goto

Collaborating Authors

 feature heterophily


A Generative Model for Controllable Feature Heterophily in Graphs

Wang, Haoyu, Ma, Renyuan, Mateos, Gonzalo, Ruiz, Luana

arXiv.org Machine Learning

ABSTRACT We introduce a principled generative framework for graph signals that enables explicit control of feature heterophily, a key property underlying the effectiveness of graph learning methods. Our model combines a Lipschitz graphon-based random graph generator with Gaussian node features filtered through a smooth spectral function of the rescaled Laplacian. We establish new theoretical guarantees: (i) a concentration result for the empirical heterophily score; and (ii) almost-sure convergence of the feature heterophily measure to a deterministic functional of the graphon degree profile, based on a graphon-limit law for polynomial averages of Laplacian eigenvalues. Index T erms-- graph generative models, homophily, graphons 1. INTRODUCTION The success of many graph information processing problems, including node-level tasks in graph machine learning [1, 2] and network topology inference [3-5], hinges on the alignment between graph topology and node features, often summarized by the notion of homophily or heterophily. We develop a generative framework for graphs and node features (i.e., graph signals) that allows explicit control of feature het-erophily in the range from homophily to heterophily.


On the Impact of Feature Heterophily on Link Prediction with Graph Neural Networks

Neural Information Processing Systems

Heterophily, or the tendency of connected nodes in networks to have different class labels or dissimilar features, has been identified as challenging for many Graph Neural Network (GNN) models. While the challenges of applying GNNs for node classification when class labels display strong heterophily are well understood, it is unclear how heterophily affects GNN performance in other important graph learning tasks where class labels are not available. In this work, we focus on the link prediction task and systematically analyze the impact of heterophily in node features on GNN performance. We first introduce formal definitions of homophilic and heterophilic link prediction tasks, and present a theoretical framework that highlights the different optimizations needed for the respective tasks. We then analyze how different link prediction encoders and decoders adapt to varying levels of feature homophily and introduce designs for improved performance.